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Complete Genome Sequence of Prepandemic Vibrio parahaemolyticus 
BB220P 

Roderick V. Jensen, 3 Saylem M. DePasquale, 3 Elizabeth A. Harbolick, 3 Tian Hong, a Alison L. Kernell, 3 David H. Kruchko, 3 
Thero Modise, 3 Cimarron E. Smith, 3 Linda L. McCarter, b Ann M. Stevens 3 

Department of Biological Sciences, Virginia Tech, Blacksburg, Virginia, USA"; Department of Microbiology, University of Iowa, Iowa City, Iowa, USA b 
S.M.D, E.A.H., T.H, A.L.K., D.H.K, T.M, and CE.S. contributed equally to this work. 

The number of inflammatory gastroenteritis outbreaks due to the food-borne pathogen Vibrio parahaemolyticus is rising 
sharply worldwide and in the United States in particular. Here we report the complete, annotated genome sequence of the 
prepandemic V. parahaemolyticus strain BB220P and make some initial comparisons to the complete genome sequence for pan- 
demic strain RIMD2210633. 
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I lihrio parahaemolyticus is found in marine-associated aquatic 
If environments freely living, attached to abiotic and biotic (e.g., 
plankton and shellfish) surfaces, or associated with marine animal 
hosts ( 1 , 2) . It is also a major cause of seafood-borne gastroenteri- 
tis in humans, occasionally causing death (2-4). Since the 
complete genome sequence of clinical isolate RIMD22 10633 was 
published (5), draft genome sequences for several other V. para- 
haemolyticus strains have been announced (6-8) or are available 
online (http://www.genomesonline.org). Here we announce the 
complete genome sequence of V. parahaemolyticus BB220P, a 
Bangladesh environmental isolate from the early 1980s (9). This 
strain, with an 04:K8 serotype, has been extensively studied with 
respect to swimming and swarming motility, biofilm develop- 
ment, and phase variation (10-15). 

A single opaque colony of the V. parahaemolyticus BB220P 
strain LM53 12 (16) was grown in heart infusion (HI) broth (13) at 
30°C overnight. The Qiagen DNeasy blood and tissue kit protocol 
for Gram-negative bacteria was utilized to extract DNA, which 
was sequenced using a Roche/454 GS FLX titanium system. A 
200 X coverage of the genome was achieved and de novo sequence 
assembly of the reads was then performed using Roche/454 New- 
bler software. MIRA (http://www.chevreux.org/projects_mira 
.html) de novo assembly was performed to assist in filling in gaps in 
the genome assembly. Alignment to the RIMD22 10633 reference 
genome sequence using the Lasergene 8 software package also 
facilitated complete genome assembly into circular chromosomes 
1 and 2 of lengths 3.297 Mbp and 1.806 Mbp, respectively. When 
necessary, primers were designed and traditional Sanger sequenc- 
ing was performed to confirm putative assembly junctions. The 
BB220P genome contains eleven ~5-kbp rRNA/tRNA clusters at 
the same locations as those in RIMD22 10633. Since unique 
Roche/454 or Sanger sequences could not span these regions, the 
corresponding RIMD22 10633 sequences were used to help fill 
these gaps. 

The genome sequence was annotated on the Rapid Annotation 



using Subsystem Technology (RAST) server (http://rast.nmpdr 
.org/) with the help of the GenBank annotation of RIMD2210633. 
Annotation of sequences unique to BB220P was performed using 
both GeneMark-P* and Genemark.hmm-P (http://exon.biology 
.gatech.edu). Coding sequences for 2,973 genes on chromosome 1 
and 1,653 genes on chromosome 2 were identified. Overall, there 
is extensive homology between the two strains; >90% of the cod- 
ing sequences have the same annotation, length, and relative po- 
sition. There are -300 genes novel to BB220P, and -400 genes 
novel to RIMD22 10633. Many of the novel genes appear to be 
remnants of transposons or phages. With respect to potential vir- 
ulence traits, and like RIMD22 10633, the BB220P genome en- 
codes thermostable direct hemolysin (Tdh) and two type-3 and 
two type-6 secretion systems, and it lacks genes encoding thermo- 
stable direct hemolysin-related hemolysin (Trh) and urease (Ure) 
(17). BB220P also lacks prophage f237 (18) and genomic islands 
VPaI-1 and VPaI-3 to VPaI-6 (19). The two strains differ in the 
superintegron on chromosome 1 (-90 genes) and in two small 
blocks encoding the O and K antigens (-12 genes each). With 
BB220P being a prepandemic isolate, the genome sequence will 
provide an important anchor for future comparative genomic and 
virulence studies. 

Nucleotide sequence accession numbers. The complete, an- 
notated genome sequence for V. parahaemolyticus BB220P 
strain LM5312 is available in GenBank under accession num- 
bers CP003972 (BB220PChrl) and CP003973 (BB220PChr2). 
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